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Geometric  (Adaptive)  Beamforming 


•  What  it  is 

-  microphone  array  with  fixed 
geometric  configuration 

-  adaptive  algorithms  to  steer  and 
adjust  beam  pattern 

•  Typical  Applications 

-  Attenuation  of  jammers 

-  source  localization 

•  Problems/Issues 

-  Requires  know/fixed  array 
configuration 

-  Cannot  handle  multiple  sources 

-  Signal  leakage,  reverberation 


(0) 


mic. 


•  source 
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Statistical  (Blind)  Beamforming 


•  What  it  is 

-  Multiple  sensors  at  arbitrary 
locations 

-  adaptive  algorithm  to  recover 
independent/decorrelated 
source  signals 

•  Typical  Applications 

-  simultaneous  recovery  of 
multiple  sources 

-  jammer  attenuation  under 
reverberation,  and  target  signal 
leakage 

•  Problems/Issues 

-  requires  low  noise  sensors 

-  computational  complexity 
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Recovering  Speech  from  Simultaneous  Recording 

_ (Statistical  Beamforming  Demo) _ 

...  isolating  individual  speakers  with  multiple  microphones  ... 


•  Instantaneous  mixture  -  corresponds  to  environment  with 
no  reverberation  and  known  time  delays. 

•  Solution  can’t  assume  knowledge  of  speaker/source 
location  -  requires  a  “blind”  algorithm. 

•  Demo  1  -  linear  mixing  of  10  speakers. 

•  Demo  2  -  “real-world”  deconvolution  with  2  speakers. 

•  Sarnoff  algorithm  exploits  non-stationarity  of  speech 
signal,  performing  multiple  decorrelation  across  time  to 
compute  a  matrix  of  “unmixing”  FIR  filters. 
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The  Problem:  Convolutive  Mixture 


Acoustic  signals  x(t)  recorded 
simultaneously  in  a  reverberant 
environment  A(i:)  can  be  described 
as  sums  of  differently  convolved 
sources  s(t). 


x(t)  =  ZA(T)s(t-T)  +  n(t) 

X  =  0 


with  dim(x)>  dim{s) 
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Context  on  Blind  Source  Separation 


PCA; 

X  = 

R  :  5  =  RTx 

ICA: 

x  =  As 
A:  s  =  A*  X 
W:  5  =  Wx 

BSS; 

X  =  A  0  5  (t) 

A  :  5  (t)  =  A  *  0x(t) 
W:^  (t)  =  W0x(t) 


i 


<s"i  S’"j>  =  5yA, +”» 

<5,(0)  5/t)>  =  ,(t) 


<  5”,  (0)5-/t)>  =  5,^  X 
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Approach  -  Use  Non-stationarity 


Measure  time  dependent  second  order  statistic 

R ,  (co,t)  =  ^  Ex(to,t  +  n7)x^(co,f  +  nT) 

n=Q 

Where  x(co,t)  are  the  frequency  components  of  frame 
[x(0,  x(t  +  I)] 


Experimental  Setup:  Speaker  with  Interfering  Source 


Reverberant  environment  (small  office  room).  The  interfering 
signal  was  a  competing  speaker  or  music. 

Left:  a  =  b  =50?  ,  c  =  50?  ,  6?  . 

Right:  b  =  30?  ,  c  =  60?  ,  oc  =  45°  ,  180°  . 
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Multiple  Microphone  Performance 


Performance  of 
multiple 
microphones  in 
simulated  room 
(small  office)  for 
separating  a  speaker 
from  music 
background. 

Microphone 
distance  2m. 

23456789  10 

number  of  microphones 

f 
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Separation  Performance  of  two  sources  in  simuiated  room 
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WER  WER 


Speech  Recognition  Improvement 


music,  opposite  45  deg  music,  opposite  180  deg  music,  parallel  50  in  music,  parallel  6  in 


before  after  clean  before  after  clean  before  after  clean  before  after  clean 

speech,  opposite  45  deg  speech,  opposite  180  deg  speech,  parallel  50  in  speech,  parallel  6  in 


before  after  clean  before  after  clean  before  after  clean  before  after  clean 


Word  error  rate  (WER)  of  ViaVoice  (IBM)  on  a  short  text  (Wallstreet 
Journal  article  of  760  words  length)  before  and  after  source  separation. 
The  result  is  contrasted  to  clean  recording  with  no  interfering  source. 


up  to  50  %  reduction  in  word  error  rate 

for  IBM  Viavoice 
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